Fast and Scalable Real-Time Monitoring System for Beowulf Clusters
نویسندگان
چکیده
Fast real-time monitoring of system information is important to the understanding of parallel system especially for a large cluster system that appeared recently. Making the system fast and scalable at the same time is still a challenging task. This paper presents the design and implementation of a fast and real time monitoring system called SCMS/RMS. This system is a part of more comprehensive cluster management tool called SCMS. SCMS/RMS is designed to be flexible, highly scalable, and efficient. Many techniques that are used to increase the monitoring speed and to achieve high scalability have been described in this paper. The experiment has been conducted on a 72 nodes Beowulf Cluster and the results show that SCMS/RMS is very fast and highly scalable.
منابع مشابه
Scalable parallel FFT for spectral simulations on a Beowulf cluster
The implementation and performance of the multidimensional Fast Fourier Transform on a distributed memory Beowulf cluster is examined. We focus on the the three dimensional (3D) real transform, an essential computational component of Galerkin and pseudo-spectral codes. The approach studied is a one-dimensional domain decomposition algorithm that relies on communication-intensive transpose opera...
متن کاملBeowulf – A New Hope for Parallel Computing?
The Beowulf model for clusters of commodity computers[15, 17, 18] has become very popular over the last year, particularly amongst university research groups and other organisations less able to justify large procurements. The Beowulf concept is usually applied[16] to clusters of Personal Computers running Linux, but other platforms and operating systems can also be considered as providing simi...
متن کاملACL 2 for Parallel Systems Software : A Progress Report
A significant development in high-performance computing has occurred in recent years with the proliferation of “Beowulf” clusters [6]. Beowulf clusters are parallel computers assembled from commodity-priced personal computers and networks. The explosive growth of the personal computer marketplace, together with rapid technological advances in the hardware sold there, has driven the price/perfor...
متن کاملAn IP-level Network Monitor and Scheduling System for Clusters
Current systems for managing workload on clusters of workstations, particularly those available for Linux-based (Beowulf) clusters, are typically based on traditional process-based, coarse-grained parallel and distributed programming. The DESPOT project is building a sophisticated thread-level resource-monitoring system for computational, storage and network resources based on SGI’s Performance...
متن کاملOptimizing Latency in Beowulf Clusters
This paper discusses how to decrease and stabilize network latency in a Beowulf system. Having low latency is particularly important to reduce execution time of High Performance Computing applications. Optimization opportunities are identified and analyzed over the different system components that are integrated in compute nodes, including device drivers, operating system services and kernel pa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001